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Abstract 

Motivated by recent intensive experimental efforts on searching for neutrinoless double-beta decays, 
we perform a detailed analysis of the physics potential of the experiments based on ,6 Ge. Assuming no 
signals, current and future experiments could place a 90% lower limit on the half life Ty 2 > 4 x 10 26 yr 
and Ty 2 > 7 x 10 27 yr, respectively. Then, how to report an evidence for neutrinoless double-beta 
decays is addressed by following the Bayesian statistical approach. For the first time, we present a 
quantitative description of experimental power to distinguish between normal and inverted neutrino 
mass orderings. Taking an exposure of 10 4 kg • yr and a background rate of 10 -4 counts/(keV • kg • yr), 
we find that a moderate evidence for normal neutrino mass ordering (i.e., with a Bayes factor B given 
by ln(S) ~ 2.5 or a probability about 92.3% according to the Jeffreys scale) can be achieved if the true 
value of effective neutrino mass turns out to be below 0.01 eV. 
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1 Introduction 


Tremendous experimental efforts have been made to search for neutrinoless double-beta decays (Ov/3/3) 
in the past decades mm- The 0 v/3/3 process is of particular importance in particle physics, since it 
is currently the unique feasible way to establish whether neutrinos are their own antiparticles, namely, 
Majorana particles [3j. The Majorana nature of neutrinos clearly indicates the violation of lepton number, 
which is accidentally conserved in the standard model of elementary particles (SM). Furthermore, one can 
also extract the information on the absolute scale of neutrino masses and constrain the Majorana-type 
CP-violating phases from the observation of Oi'/3/3, which can not be obtained from neutrino oscillation 
experiments [4]. Hence, the establishment of Majorana nature of neutrinos, the determination of absolute 
neutrino masses, and the constraints on Majorana CP-violating phases will greatly help us explore the 
origin of neutrino masses and flavor mixing, and pave the way to new physics beyond the SM ESI. 

If neutrinos are indeed Majorana particles, it will be possible to measure a half life of some nuclear 
isotope in the Oz^/3/3 mode, i.e., N(A,Z) —>■ N(A,Z + 2) + 2e _ with A and Z being mass number and 
atomic number of the isotope, respectively. The half life is related to the effective neutrino mass via 

(r^ 2 y' = G 0u |.Ad 0! ,| 2 rripp , (1) 

where G 0l/ stands for the phase-space factor [Jj, and M 0l/ for the relevant nuclear matrix element (NME). 
Both the phase-space factor and NME can be calculated theoretically, although currently the latter still 
suffers large uncertainties from nuclear models [8]. In addition, the effective neutrino mass is defined as 
rripp = | U 2 1 m 1 + U 2 2 m 2 + U^ 3 m 3 \, where U ei (for £ = 1,2,3) denote the matrix elements in the first row of 
lepton flavor mixing matrix U, and m i (for i = 1,2,3) are neutrino masses. For Majorana neutrinos, the 
3x3 unitary matrix U is usually parametrized in terms of three mixing angles {# 12 )^ 13 )^ 23 }) one Dirac 
CP phase 5, and two Majorana CP phases Taking the standard parametrization of U [41 and 

properly redefining the charged-lepton fields, we obtain 

rripp = |mi cos 2 #12 cos 2 #i 3 e 2l< D + m 2 sin 2 #12 cos 2 d^e 21 ^ 2 + mn 3 sin 2 (9i31 , (2) 

where the mixing angle 0 23 and the Dirac CP-violating phase 5 turn out to be irrelevant. In Eq. ([2]). it 
has actually been assumed that the exchange of light Majorana neutrinos is the dominant mechanism for 
the 0v(3(3 process to take place, and any other new-physics contributions are negligible. The effective mass 
mpp connects the half life to the fundamental physical parameters, namely, neutrino mixing angles, 
neutrino masses, and Majorana CP-violating phases. As an interesting consequence of such a connection, 
one can determine the currently unresolved neutrino mass ordering by a precise measurement of . It 
should be emphasized that this discrimination of mass ordering is only possible under the assumption that 
neutrinos are Majorana particles. If neutrinos are of Dirac nature, no signals would be observed in 0 u/3/3 
experiments and thus no information on mass ordering can be extracted. Based on current knowledge of 
neutrino mixing angles and mass-squared differences [1], one can immediately find that if neutrinos have 
an inverted mass ordering (10), i.e., m 3 < m 1 < m 2 , fnpp has a lower limit map 1 ^ ~ 0.015 eV. No lower 
bound exists for normal neutrino mass ordering (NO), i.e., m 1 < m 2 < m 3 , and the minimum of mpp 
could be vanishingly small due to the intricate cancellation among three terms on the right-hand side of 
Eq. Pj). See, e.g., Ref. [15] . for a detailed analysis. If the cancellation indeed happens, it is impossible to 
determine the Majorana nature of neutrinos even in the far-future 0 u/3/3 experiments. 

As the determination of neutrino mass ordering is fundamentally important and very challenging for 
neutrino oscillation experiments, it is intriguing if the 0 u/3/3 experiments could provide valuable infor- 
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mation. At present, no observation of 0iz/3/3 leads to a lower bound on the half life Ty 2 , which can be 
translated into an upper bound m^g| exp on the effective neutrino mass, namely, < rngg| exp . Naively 
speaking, one may claim that the 10 is excluded if | exp < mgg||^ n is reached in future. Obviously, 
it is necessary to put such a claim on a solid statistical ground, and clarify how current and future 0 u/3/3 
experiments can distinguish 10 from NO. This is the primary motivation for the present work. 

There have been enormous experimental efforts spent on searching for CO/3/3, using various nuclear 
isotopes, such as 76 Ge, 136 Xe and 130 Te. See, e.g., Refs. u ei ei maim for recent reviews on this subject. 
In this paper, however, we restrict ourselves to the ' 6 Ge-based experiments. There are two main reasons 
for this choice. First, since the ' 6 Ge-based experiment possesses an excellent energy resolution, we have a 
fairly simple background, which has been found to be nearly flat in the signal region. Hence it is rather 
straightforward to perform projections on physics prospects of this kind of experiments. Second, our 
analysis will serve as a typical example for the other types of Ozz/3/3 experiments. Once their dominant 
backgrounds are well known, one can carry out a similar analysis of them as well0 

Currently, two 76 Ge-based Oz^/3/3 experiments, Gerda and Majorana Demonstrator [lTj, are com¬ 
missioning. In the first phase of Gerda, a lower limit of T®J 2 > 2.1 x 10 25 yr at 90% confidence level was 
reported for an exposure of 21.6 kg • yr and a background index (BI) about 10~ 2 [15], corresponding to a 
background rate of ICR 2 counts/(keV • kg • yr). The Gerda Phase-II and Majorana Demonstrator are 
expected to increase the exposure to about 200 kg • yr each, and to reduce the background down to the 
ICR 3 level at the same time. Such an improvement would increase the current lower limit on Jj°/ 2 by one 
order of magnitude, as shown in Ref. m- More excitingly, these two collaborations are also discussing the 
possibility of building a future large scale 76 Ge (LSGe) experiment together |17| . which may eventually 
reach an exposure of around 10 4 kg • yr, and a reduction of the background index by another order of 
magnitude, namely, BI = ICR 4 . Stimulated by these experimental activities, we will study the physics 
prospects for the ongoing and upcoming 76 Ge-based Oz^/3/3 experiments. In a recent work m, an analysis 
of the projected exclusion limit has been performed. Now we extend that work in two aspects. First, we 
include a detailed discussion on the experimental potential of discovering a non-null signal. Second, given 
the possibility of building a tonne-scale experiment LSGe in the future, we make the first attempt to find 
out at which level the currently undetermined neutrino mass ordering can be resolved then. 

The remaining part is structured as follows. To establish our notations and specify the framework 
of our discussions, we present some general remarks in Section 2, including a brief description of 'de¬ 
based CR/3/3 experiments, the implemented method for numerical simulations, and an introduction to the 
Bayesian analysis. In Section 3, we apply the Bayesian method to set up the exclusion limits, discover a 
non-null signal and discriminate neutrino mass orderings in current and future experiments, following the 
approach in Ref. m- Finally we summarize our main results in Section 4. 

2 General Remarks 

2.1 Description of 76 Ge-based experiments 

In a real Oz^/3/3 experiment, we determine the half life T®J 2 by collecting events in a narrow energy window, 
where the signals of our interest should appear. Given a total exposure £ (i.e., a product of the fiducial 

1 One may ignore the issue of background modelling by performing a “single-bin” rate analysis, as was done in m- 
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Figure 1: A simulated event spectrum of 0 v/3/3 for the 76 Ge-based experiment, where the red solid curve 
represents the expected Gaussian distribution of signal events on top of a constant background and typical 
values of the half life Ty 2 = 10 25 yr, the exposure £ = 50 kg-yr, the energy resolution FWHM = 3 keV, and 
the efficiency e = 0.65 have been used. The black-dashed horizontal line corresponds to the background 
with BI = 10 -2 . The gray histograms stand for the total number of both signal and background events, 
which is randomly generated according to the Poissonian distribution in each energy bin. 


detector mass and the live time) and a detection efficiency e, the expected number of signal events reads 


N 0v = In 2 • AG 


£ ■ e 


M ■ T 0u 
1V1 Ge 1 l/2 


(3) 


with N a = 6.022 x 10 23 being the Avogadro’s constant, and Ms e = 75.6 g the molar mass of ,6 Ge. 
Ideally, all the Oi'/3/3 events can be identified by the total energy of two final-state electrons, which should 
show up at the single energy scale 2039 keV, i.e., the Q- value of 0zyS/3 for ,6 Ge. However, because of a 
finite energy resolution, usually denoted by FWHM (Full Width Half Maximum), in realistic experiments, 
the observed signal events have a Gaussian distribution, with the mean at the Q-value and the width 
determined by FWHM. The ' 6 Ge-based Ozx/3/3 experiments have outstanding energy resolutions and high 
detection efficiencies. Although the exact values of these parameters vary for different experiments, they 
are close to each other. For simplicity, we choose FWHM = 3 keV and e = 0.65 as a benchmark point. 

Besides the signal events, backgrounds are also present in the signal region. Since the background rate 
varies among current and future experiments, we categorize them into two classes: the first one has a BI 
of 10 -3 for current experiments, while 10” 4 for the future LSGe experiment. In addition, we assume a 
flat background model in the energy window of interest, which is quite reasonable because of an excellent 
energy resolution. Regarding a realistic choice of the exposure, we follow Ref. m and take £ to be up to 
400 kg • yr for current experiments in total, while a maximum of 10 4 kg • yr for the future one. 

In Fig. [T] we show an example of the event spectrum at a 76 Ge-based experiment. The red solid 
curve represents the Gaussian signal events on top of a constant background (i.e., the black dashed line). 
Additionally, the gray histograms stand for the total events in each energy bin, which are randomly 
generated according to the Poissonian distribution. The data have been simulated by assuming a half 
life of T\J 2 = 10 25 yr, BI = 10“ 2 , and £ = 50 kg • yr. The benchmark values of the energy resolution 
and detection efficiency have been used, and the width of energy bins is 1 keV. The simulated number 
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of events, including both signals and backgrounds, is to be fitted by the model parameter (i.e., the half 
life T®J 2 or equivalently the effective neutrino mass m^), and the details of numerical simulations and 
statistical analysis will be given in the following two subsections. 

2.2 Numerical simulations 

As shown in Fig. [TJ we will generate a set of pseudo-data and perform projections with them. Following 
Gerda’s analysis on its first phase data m, we consider a region of spectrum that spans from 2022 keV to 
2061 keV with a bin size of 1 keV. The samples of pseudo-data are generated in two different ways. In the 
first one we simply let the number of events in each energy bin be the expected value from the background 
and signal functions. The data set produced in this way is known as the Asimov data m- Once the data 
are available, one can carry out a statistical analysis to test different hypotheses, or to estimate model 
parameters by fitting the data. 

We also produce the pseudo-data samples by using the Monte Carlo method. More explicitly, the 
events in each energy bin are randomly generated by assuming a Poissonian distribution with the same 
expected number of events as for the Asimov data. In this case, a large number of samples are first 
generated, and each of them is subsequently implemented to test hypotheses or extract model parameters. 
As a consequence, the statistical distributions of desired quantities can be obtained. This approach is 
referred to as an ensemble test. 

Apparently, no statistical fluctuations are considered in the Asimov data set. However, it has been 
found in Ref. m that the analysis of the Asimov data yields a good approximation to the median projection 
of experiments. Therefore, one can employ it to get some quick and qualitative results without the burden 
of analyzing lots of Monte Carlo samples, as it would be in the case of an ensemble test. For this reason, in 
the following discussions, we first perform a qualitative projection with the Asimov data set, and support 
the results by an ensemble test. Next, we present the detailed numerical results with the help of the 
MultiNest program [201 EH [22]. The sampling is efficiently carried out by using MultiNest, so is the 
Bayesian analysis of simulated data. 

2.3 Bayesian analysis 

Now we give a brief introduction to Bayesian statistical analysis. See, e.g., Refs. [23, 23], for more details. 
Bayesian analysis resides on the well-known Bayes’ theorem, and describes the degree of belief in a certain 
hypothesis 'H i (for i = 1, ...,r), given the data set V. Note that the hypotheses 7~L i are mutually exclusive, 
and only one of them is actually true. According to the Bayes’ theorem, the posterior probability of the 
hypothesis 7i i , denoted as PviTi^V), is given by 

w> - . (4) 

In the above equation, Pr('H i ) stands for the prior probability of the hypothesis 'H i , reflecting our prior 
degree of belief in such a hypothesis. Pr(T>\T-Li) is the probability of obtaining the data T>, assuming the 
hypothesis 7ii to be true, and is called the evidence Zi of the hypothesis 'Rj. Lastly, the overall probability 
of observing the data V is denoted by Pr(D), and it is equal to Pr(X > |'Hi)P r ('%j), because of the 

normalization condition JN =1 Pr(^j|P) = 1. 

One direct application of the above formalism is to make model selection. With the help of Eq. ([4]), 
one can compute the posterior odds of two competing hypotheses by taking the ratio of their posterior 
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ln(odds)| 

Odds 

Probability 

Interpretation 

< 1.0 

<3:1 

< 75.0% 

Inconclusive 

1.0 

~ 3 : 1 

~ 75.0% 

Weak evidence 

2.5 

~ 12 : 1 

~ 92.3% 

Moderate evidence 

5.0 

~ 150 : 1 

~ 99.3% 

Strong evidence 


Table 1: The Jeffreys scale used for the statistical interpretation of Bayes factors, posterior odds and 
model probabilities [27, [28]. 


probabilities, namely, 


Pr(^i|P) = Zj Pr (Hi) 
Pr(^|P) Zj Pr("Hj) ’ 


(5) 


where the ratio of evidences B = Z{/Zj is termed Bayes factor. In the case of no prior preference for 
any hypothesis (i.e., equal prior probabilities), the posterior odds is then directly reflected by the Bayes 
factor. Moreover, to interpret the value of this posterior odds or the Bayes factor, one often adopts the 
Kass-Raftery [25] or Jeffreys [26, [23 [28] scale. In Tabled we list the Jeffreys scale that used in [271 128]. 
and will implement them to interpret the results in the present work. 

Another realm of applying the Bayesian analysis is the parameter estimation. Now suppose that there 
exists a single hypothesis H that describes the data well, and this hypothesis is specified by a set of free 
parameters 0. The posterior probability distribution of 0 is then calculated as 


Pr(0| V,H) 


Pr(X>| e,H) Pr(0| H) 
Pr(2?| H) 


£(Q) vr(0) 
Z 


( 6 ) 


where the likelihood function Pr(U|0,'H) and the prior probability distribution Pr(0|%) are usually also 
denoted as £(0) and vr(0), respectively. Similarly, because of the normalization condition for the posterior 
probability, the evidence Z is found to be 


Z = 


Pr(T>|0, T~L) Pr(0|ft) d^© 


£(0) tt( 0) d^© , 


(7) 


where N is the dimension of parameter space. Note that in the parameter estimation, the evidence Z is 
a common factor for all parameters in 0, and thus can be ignored. This is not the case if some other 
hypotheses are also considered, as we have seen from the discussions of model selection. The posterior 
distribution of 0 then essentially depends on two factors, the likelihood function £(0) and the prior 
distribution 7r(0), and one may view this parameter estimation as an update on the prior distribution in 
light of new data. With the posterior distribution of the parameter, one can extract the most probable 
value and the credible regions in the standard way [25] . 

We now discuss the application of the above general formalism to the Ozz/3/3 experiments. For con¬ 
creteness, let us consider the analysis of setting up the exclusion limit, which belongs to the category of 
parameter estimation. The essential part of this analysis is to obtain the posterior distribution of the num¬ 
ber of signal events N 0u by fitting the simulated spectrum as given in Fig. [TJ In the fit, our parameters of 
interest 0 are the number of signal events N 0u and the possible background events B, i.e., 0 = {IV 0 *', B}. 
Both N 0u and B are drawn from some uniform prior distributions ir(N 0u , B). With each specified set 
of 0, we first calculate the expected number of events in each bin Awhere i is the bin index, 
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by assuming a Gaussian distribution of the signal events and a flat background (exactly in the same way 
as we generate the simulated spectrum in Fig. []]). Then, we compare the expected events A, with the 
simulated data events n, in each bin, and obtain the likelihood £(N 0u ,B) of this chosen set {N 0u , B} by 
assuming a Poisson distribution, i.e., 

xTLi 

C(N°\B) = H^-e- x % ( 8 ) 

h n i- 

where A r . is the total number of bins. With the prior distribution ir(N 0u , B ), we find the two-dimensional 
distribution Pr({iV 01 ', B}\T>) = C(N 0u , B) ir(N 0u , B)/Z, from which the posterior distribution of N 0u can 
be obtained by marginalizing over the background events, namely, Pr(iV 0 ! '|'D) = / Pt({N 0u , B}\T>) d B. 
Based on the posterior distribution, one can derive the exclusion limits on and via Eq. <[3|) and 
Eq. (P), respectively. 

For the claim of discovering the 0 v(3(3 decays and the discrimination of neutrino mass orderings, the 
same spirit as above will be followed, except that those two analyses belong to hypothesis test and the 
fitting parameters are different. For instance, the model parameters of our interest in the discrimination 
of neutrino mass orderings would be neutrino mixing parameters. In order to fit the data, however, one 
has to convert each set of neutrino mixing parameters to the expected number of signal events N 0l> via 
Eqs. (HD, © and Q- After doing so, we can similarly calculate the likelihood as defined in Eq. ([HD, and 
then obtain the posterior distribution for the chosen set of neutrino parameters. As now we are concerned 
with hypothesis test, in a final step, we integrate the product of the likelihood function and the prior 
distribution over the whole allowed range of neutrino parameters to obtain the evidence. By comparing 
the evidences of two competing hypotheses of neutrino mass orderings, we then know which one is more 
favorable, given a particular data set. 

3 Prospects for 76 Ge-based experiments 

3.1 Setting up the exclusion limit 

First, we consider how to set up the exclusion limits in current and future 76 Ge-based experiments. To do 
so, we generate the pseudo-data samples without signal contributions, and derive the posterior distribution 
of the number of signal events N 0u by fitting the data. A uniform prior on N 0u is assumed in the fit, 
and the background level is allowed to float. As already pointed out in Ref. SSI, including systematic 
uncertainties yields negligible changes to the final results, so we choose to ignore them as well. The 90% 
upper limit on the signal events A^g can be obtained according to 

Pr(N 0u \V) dN 0u = 0.90 , (9) 

where Pr(A^ 0 l/ |T > ) is the posterior distribution of N 0u . Furthermore, it is straightforward to translate A^ g 
into the 90% lower limit on T % 2 by using Eq. Q. 

In Fig. [21 we present the projected 90% lower limit on T % 2 as a function of the total exposure £, where 
the Asimov data are used. Four different background levels are considered, i.e., BI of 10 -2 (red dot), 10 “ 3 
(green triangle), 10 ~ 4 (blue square) and background free (black star). Some comments are in order: 

• An interesting observation is that for each background level, the projected lower limit (Ty 2 ) 0.9 has a 
good power-law dependency on the exposure, i.e., (Ty 2 ) 0.9 oc £“, where a is defined as the scalability 
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Figure 2: The 90% lower limit on T[% as a function of the total exposure in the case of the Asimov data. 
Four different background levels are considered, i.e., BI of 10 -2 (red dot), 10 -3 (green triangle), 10~ 4 (blue 
square) and background free (black star). The scalability index a is found by fitting each data set. 

index (SI). This index characterizes the gain on sensitivity when the experimental setup is expanded 
to a larger scale. From Fig. [21 one can see that a larger BI leads to a lower SI. Consequently, scaling 
two experimental setups by a common factor, one can conclude that the experiment that has a larger 
background would have less gain on the sensitivity. This demonstrates that both increasing the scale 
of experiments and reducing the backgrounds are very important to improve the sensitivity. 

• Moreover, one may also notice that the SI is ranging from 0.5 to 1 for different background levels. 
This can be understood in the following way. First of all, let us start with the background-free case. 
Since the pseudo-data samples are generated without signal contributions, there should be no events 
at all in all energy bins. For any values of the exposure, one actually fits the same empty data 
sample, so the upper limit IVq g remains unchanged. With such a fixed IVgg, we find that the lower 
limit on T|% is indeed proportional to the exposure, according to Eq. ([3]). Therefore, the SI of a = 1 
in the background-free case is understood. 

Then, we come to nonzero backgrounds. Since a data sample with backgrounds has sensitivities 
only to the signal that is large enough to manifest itself as the background fluctuation, an upper 
limit on N 0u can be obtained when it is on the same order of background fluctuations, which can be 
characterized by er bkgd , i.e., g ~ <r bkgd . For the value of <7 bkgd , one may take it to be the standard 
deviation <r bkgd = ^/IV bkgd . Since A r bkgd itself scales linearly with the exposure, we finally obtain 
(T^Q.g oc y/£. Thus, this explains a ~ 0.5 for BI = 10~ 2 that is large enough. For the intermediate 
values of BI, one can interpret the corresponding Si’s as they should interpolate between 0.5 and 1. 

The results from ensemble tests are given in Fig. [3l where we just consider the cases with BI of 10 -3 
and 10~ 4 that respectively represent the current and future experimental setups, and realistic ranges of 
the exposure are chosen. For both cases, the median projection on the lower limit of Ty 2 (black curve) 
together with its lcr (green) and 2<r (yellow) bands is mapped out. For comparison, we also depict the 
results from the Asimov data set, corresponding to the gray dashed curve. As one can see, the Asimov 
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Figure 3: The 90% lower limit on Ty 2 as a function of the total exposure in the case of ensemble test. 
Left (Right) panel corresponds to the background level of BI = 10~ 3 (10~ 4 ). The la and 2a bands are 
denoted by the green and yellow regions, while the median values are represented by the black curves. For 
comparison, we also show the results from the Asimov data set by gray dashed lines. 

data set yields rather good approximation to the median performance. Finally, one finds that for the 
current generation of experiment, a median exclusion limit of Ty 2 10 26 yr can be obtained, while 

Ty 2 > 7 x 10 27 yr for the future one. Our conclusions are rather close to those given in |16j . 

3.2 Discovering a non-null signal 

Then, we proceed with the analysis of discovering a non-null signal. Unlike the analysis of exclusion 
limits, we generate samples with the signal contributions, and analyze them so as to find out at which 
significance level the experiments can discover the 0 v/3/3 signal. In practice, the signal contributions in 
data are calculated by inputting different true values of Ty 2 - For each true value of T®J 2 we perform a 
Bayesian hypothesis test with the following two competing hypotheses: 

Ti : the sample spectrum is due to background only ; 

H. : the sample spectrum contains both signal and background contributions . 

The prior probabilities of H and H are assumed to be equal, and a moderate (strong) evidence for a signal 
will be claimed if the logarithm of the Bayes factor In B = In {Z^/Zy) is larger than 2.5 (5.0), according 
to the Jeffreys scale in Table [TJ 

As before, we first perform the analysis using the Asimov data set. In this case, since no statistical 
fluctuation is considered, a definite conclusion regarding moderate or strong evidence for a signal can be 
made for any true value of T®Y 2 . For a given set of exposure and background level, one only needs to present 
an upper limit on the true value of Ty 2 . below which one has about 50% probability of reporting a moderate 
or strong evidence for the signal. In Fig. |4l we present the detailed projection on the moderate and 

strong evidence limits on T^J 2 as a function of the exposure. Three different background levels, BI of 1CU 2 , 
10~ 3 and 10~ 4 , are considered. Moreover, in order to investigate the dependence on priors, we adopt two 
different scenarios, i.e., a uniform prior and a logarithmic prior both for l/T®J 2 € [1CU 29 ,10 -24 ] yr~ 4 i§ 
By carefully inspecting these results, we observe several interesting features. First, the dependence on 

2 Our choice of imposing a prior on 1 /T^J 2 is to follow the analysis done by GERDA [15]• In fact, it is equivalent to doing 
that on the signal event N 0 " according to Eq. ®. Moreover, for the fitting parameter, about which very limited experimental 


9 







Figure 4: Upper limits on Ty 2 , below which one has about 50% probability of reporting a moderate (top 
panel) or strong (bottom panel) evidence for the Ou/3/3 signal, as a function of the exposure. The results 
have been obtained by using the Asimov data. Three different background levels, i.e., BI of 10~ 2 (red 
dot), 10 -3 (green triangle) and 10 -4 (blue square), are considered. In addition, the left (right) panel 
corresponds to the case of a uniform (logarithmic) prior on 1 /T{% 2 in the fit. 

the priors of 1 /Ty 2 seems to be quite mild, which indicates our results are robust against the choices of 
priors. Second, for all cases, a good power-law relation between the required maximal value of Ty 2 and 
the exposure is observed, as in the previous case of exclusion limits. Lastly, the limits for moderate and 
strong evidence cases are quite similar, and only about 20% discrepancy between them has been observed. 

We next turn to the case of ensemble tests. For each true value of Ty 2 , one now needs to generate 
many samples using the Monte Carlo method. Because of statistical fluctuations, we may not reach a 
consensus about discovery among all samples. Therefore, instead of simply reporting an upper limit on 
T\J 2 . we have to associate a probability of reporting a moderate or strong evidence for each In Fig. [5l 
we present the contours of Ty 2 f° r ^ ve different probabilities of claiming a moderate or strong evidence 

information is currently available, we tend to choose a uniform prior if it is dimensionless, while considering both uniform 
and logarithmic priors if it has a dimension. Such a treatment is rather conservative and reflects our current ignorance of 
that parameter. Once the experimental information is available, one may easily incorporate the realistic prior (e.g., a normal 
distribution) into the analysis. 
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Figure 5: Upper limits on a t which one may report a moderate (top panel) or strong (bottom panel) 
evidence for the signal, as a function of the exposure in the case of ensemble test. A uniform prior on 
l/rjy" 2 is taken in the fit. Left (Right) panel corresponds to BI of 10 -3 and 10~ 4 , respectively. For each 
sub-figure, we show the contours that have probabilities of 5% (red), 25% (green), 50% (black), 75% 
(orange) and 95% (blue) to report a moderate or strong evidence. For comparison, the results from the 
Asimov data set are given by the gray dashed lines, which are almost indistinguishable from the 50% black 
solid curves. 


as a function of the exposure. Again, we only consider realistic choices of exposure and background levels. 
The prior on 1 /is chosen to be uniform, as it has been demonstrated that different priors yield almost 
identical results. 

From Fig. [5] we observe that the results from the Asimov data (gray dashed curves) agree very well 
with the contours that have 50% probability of reporting a moderate or strong evidence. Secondly, the 
contours for different probabilities also show good power-law relations with respect to the exposure. Their 
Si’s seem to be quite close. Finally, one can report a strong evidence for a half life of 10 26 yr with 

50% probability for current generation of experiments, while Ty 2 ~ 2 X 10 2 ' yr for the future ones. 

3.3 Discriminating neutrino mass orderings 

Finally, we are ready to deal with the important issue of discriminating two neutrino mass orderings in the 
' 6 Ge-based Or^/3/5 experiments. From previous discussions, we have already seen that current generation of 
experiments have a sensitivity only to ~ 10 26 yr, which is too small to tell two mass orderings apart. 
Therefore, in this subsection, we focus on the future generation of experiments, which has a BI of 10” 4 
and an exposure up to 10 4 kg • yr. 

In this analysis we are using the Asimov data, which should represent the median case of a full Monte 
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Carlo study. The signal contributions are included by taking various true values of T^J 2 . as done in the 
previous discovery analysis. Then, we perform the Bayesian hypothesis test and compare between the 
evidences of the NO and 10 hypotheses. The details of our calculations are summarized as follows: 

1. What we actually need to do is to find out whether NO or 10 is preferred, given a simulated 
data set V. This can be achieved by calculating the evidences i?NO = / Pr(D|0, NO)7r(0)d N 0 
and Z\q = / Pr(D|©, IO)7r(0)d N 0, where 0 now stands for neutrino mass and mixing parameters. 
Therefore, we start with the prior probabilities of neutrino parameters! i.e. , three lepton mixing 
angles 8 tJ (for ij = 12,13,23), three CP phases {5, ip 1 ,(p 2 }, two neutrino mass-squared differences 
and the lightest neutrino mass m 0 . For the mixing angles and mass-squared differences, which have 
been precisely measured, we assume Gaussian priors with the central values and lcr errors taken 
from Ref. [29]. For CP-violating phases, uniform priors are chosen. Lastly, we adopt two different 
priors for the lightest neutrino mass m 0 , i.e., a uniform prior within [0,0.2] eV and a logarithmic 
prior on [10 —5 ,0.2] eV. 

2. With neutrino parameters, we are able to calculate the corresponding effective neutrino mass m^ 

by using Eq. ([2]). For ,6 Ge decays, the phase-space factor G 0l/ = 2.63 x ICR 25 yr _1 eV~ 2 has been 
given in Refs. 0 [30]. In order to calculate in Eq. (0) and eventually the number of signal 
events in Eq. Q, we have to specify the values of NME on which currently there exist large 

uncertainties. For illustration, we consider two limiting cases, namely, = 4.6 referring to as the 
“Low NME” case while M. 0l/ = 5.8 as the “High NME” case! Putting all together, we compute 
the evidences for Z NO an d ^io ' n the NO and IO cases, respectively. Since we assume equal priors 
for these two hypotheses, the ratio of their posterior probability coincides with the ratio of their 
evidences, i.e., the Bayes factor B = Z NO /Z lo . Finally, one then uses the Jeffreys scale in Tabled] 
to interpret such a ratio, and figure out the level of preference for neutrino mass ordering. 

The primary results in this paper have been summarized in Fig. [6] where the contours of Bayes factors 
in the plane of and £ are given. The left and right panels correspond to the uniform and logarithmic 
priors on the lightest neutrino mass m 0 , respectively. Given a true value of rripa and an exposure £, we 
can calculate the Bayes factor B. Then, the contours of ln(£>) in the plane of and £ are plotted as 
thick solid (dashed) curves for the “Low (High) NME” case. The black and orange curves stand for the 
contours of ln(H) = 1 and 2.5, respectively. In addition, there exists two blue curves for ln(R) = —1 in 
the upper part of the right panel. In the chosen ranges of T^'L and £, we have not found the contours of 
ln(R) = 5 and —2.5, which implies a limited power of discrimination between NO and IO even in a future 
experiment. Lastly, the red horizontal line is for rripp |]^ n that calculated by using the best-fit values of 
neutrino mixing parameters, while the dark (light) band is for the result by using la (3a) ranges. 

After clarifying the conventions in Fig. [6l we are now at the position of drawing some physical impli¬ 
cations from it. First of all, it is evident that the results depend on the prior distribution of the lightest 
neutrino mass m 0 . If the true rripp is found to be large and well above the minimum rripp\^ in , both IO and 
NO can explain such a large value, when a uniform prior on m 0 is assumed. However, in the logarithmic 
prior case, the evidence will receive significant contributions from the region of much smaller m 0 , for which 

3 In fact, we do not need d 2 3 and <5, as the effective neutrino mass rripp defined in Eq. ([2| does not depend on them. 

4 Here we follow the same treatment on NME as that given in Ref. [16], where NME is restricted in the range between 
4.6 and 5 . 8 . This range covers most of the calculations available in the literature, except for that in the nuclear shell model, 
where a rather small value of 2.3 is predicted [8j. Adopting the result from the nuclear shell model, we find that the contours 
of the Bayes factor in Fig. [ 6 ]would shift further to the right, decreasing the discriminating power on neutrino mass orderings. 
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Figure 6: Contours of the Bayes factor In B = ln(iJ NO /Zio) calculated by comparing between NO with 10, 
where the thick solid (dashed) curves correspond to the low NME _M 0l , = 4.6 (the high NME = 5.8). 
The orange, black and blue curves represent ln(£>) = 2.5, 1 and —1, respectively. 

10 will result in a larger rripp than NO. Therefore, 10 is slightly preferred in this case, as indicated by the 
blue curves in the right panel. 

However, when the true value of rripp turns out to be close to rripp |J^ n or even smaller, the dependence 
on the priors of m 0 becomes much weaker, and the results from both uniform and logarithmic priors tend 
to converge. In the following, we focus on this case, in which the results will not be much affected by the 
choices of priors. One can observe that both the contours of ln(£>) = 1 and 2.5 indicate a preference for NO 
over 10. An immediate question is at which level we can exclude 10 experimentally. In this connection, 
two comments are in order. 

First, as indicated by these contours, it is now evident that the exposure at least 500 (or 2500) kg • yr 
needs to be accumulated in order to report a weak (or moderate) evidence for NO. According to the 
Jeffreys scale in Tabled! the weak and strong evidence should be understood as a degree of belief of 75.0% 
and 92.3%, respectively. This is due to the fact that when the exposure is too low, even though the true 
rripp is much smaller than rripp |J^ n , it cannot be precisely reconstructed from the data. As an example, 
we consider two different exposures, i.e., £ = 500 kg • yr and 10 4 kg • yr. In each case we first generate 
the data sample with the true value of T\J 2 being 2 x 10 28 yr, which corresponds to rripp ~ 0.01 eV below 
m^IrSn according to Eq. (JTJ) . We then fit these data samples with test values of Ty 2 , and show their 
likelihood functions (red and green curves) for both exposures in Fig. [TJ where we also draw the upper 
bound of Ty 2 in IO (vertical dashed line) in the “Low NME” case. As one can see, in the £ = 500 kg • yr 
case, although now the true value of Ty 2 above the IO upper bound, the likelihood function of the 
fitted Ty 2 spreads out into the region allowed by IO. As a result, IO still has a non-negligible probability 
of explaining the data, weakening the preference of NO over IO. In contrast, when £ = 10 4 kg • yr, the 
probability of explaining the data with in the region allowed by IO is much smaller, resulting in a 
higher degree of belief of preferring NO over IO. These findings agree with what we have observed by using 
a full statistical analysis in Fig. [6l 
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Figure 7: The likelihood function of the fitted gi yen the true value of Ty 2 being 2 x 10 28 yr. Red 
and green curves correspond to two different exposures 500 kg • yr and 10 4 kg • yr, respectively, while the 
vertical dashed line indicates the upper bound of Ty 2 in 10 assuming the “Low NME” case. Background 
index is kept to be 10~ 4 , and the other experimental parameters are the same as before. 

Second, in the future generation of ,6 Ge-based Oz^/3/3 experiments with an exposure of 10 4 kg-yr, we are 
able to reach a value of In(6) > 2.5, indicating a moderate evidence for NO. This conclusion is viable for 
both uniform and logarithmic priors. Fig. [6] also confirms the fact that if the true rripp is above 
then it is impossible to significantly distinguish between NO and 10 within the 0 is/3/3 experiments. 

4 Summary and conclusion 

Current intensive experimental efforts on searching for 0 is/3(3 motivates us to perform an analysis of their 
physics prospects. In particular, we focus in the present work on the ,6 Ge-based Oz^/3/3 experiments, for 
its relatively simple background model. Three important aspects of this type of experiments are studied. 
First, we consider to set up the exclusion limits on the half life. It has been found that for the current 
generation of experiments (i.e., £ = 400 kg-yr and BI = 10~ 3 ) we can reach a half life of 4 x 10 26 yr, while 
7x 10 2 ' yr for the future one (i.e., £ = 10 4 kg • yr and BI = 10~ 4 ). Second, we investigate the experimental 
ability of discovering a non-null signal. Our results indicate that if the true half life is below 10 26 yr, 
one can report a strong evidence (or a degree of belief of 99.3%) for it with at least 50% probability in 
the current generation of ,6 Ge-based experiments. For the future generation of experiments, the above 
limit can be increased to ~ 2 x 10 27 yr. Lastly, we perform a statistical study on at which level the 
future generation of ,6 Ge-based experiments can discriminate two neutrino mass orderings. The primary 
results are given in Fig. [6l We find that for those experiments that have a very low background level of 
BI = 10” 4 , the exposure has to be at least ~ 500 (~ 2500) kg • yr before having the sensitivity to report 
a weak (moderate) evidence for NO against IO. According to the Jeffreys scale in Table [H the weak and 
moderate evidences should be understood as a degree of belief of 75.0% and 92.3%, respectively. However, 
even in the future generation of experiments with an exposure of 10 4 kg • yr, a strong evidence for neutrino 
mass ordering cannot be obtained. In that case, we may achieve a definite conclusion on neutrino mass 
ordering by combining both 76 Ge-based and other 0 v/3/3 experiments together. Nevertheless, the 0 v/3/3 
experiments serve as another independent probe to neutrino mass ordering, which is complementary to 
the neutrino oscillation experiments 131 ’ . 
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Although we have concentrated on the 76 Ge-based experiments, it should be emphasized that our 
analysis can directly be applied to other 0 u/3/3 experiments with distinct nuclear isotopes. Furthermore, 
a combined analysis of all types of experiments, as done in Refs. [321 f30|. will be helpful in extracting the 
valuable information not only on neutrino mass ordering but also on the absolute neutrino mass scale and 
Majorana CP-violating phases. 
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